LOCUS V00451 3400 bp DNA linear PLN 14-NOV-2006 DEFINITION Glycine max leghemoglobin gene or pseudogene (no mRNA detected). ACCESSION V00451 L00005 L00006 VERSION V00451.1 GI:18592 KEYWORDS leghemoglobin; pseudogene. SOURCE Glycine max (soybean) ORGANISM Glycine max Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine. REFERENCE 1 (bases 1 to 3400) AUTHORS Wiborg,O., Hyldig-Nielsen,J.J., Jensen,E.O., Paludan,K. and Marcker,K.A. TITLE The structure of an unusual leghemoglobin gene from soybean JOURNAL EMBO J. 2 (3), 449-452 (1983) PUBMED 11894962 COMMENT On or before Jan 11, 2002 this sequence version replaced gi:170001, gi:170002. FEATURES Location/Qualifiers source 1. .3400 /organism="Glycine max" /mol_type="genomic DNA" /db_xref="taxon:3847" CDS join(363. .460,555. .663,2182. .2286,3065. .3208) /codon_start=1 /product="leghemoglobin" /protein_id="CAA23729.1" /db_xref="GI:313502" /db_xref="GOA:Q42801" /db_xref="HSSP:P02238" /db_xref="InterPro:IPR000971" /db_xref="InterPro:IPR001032" /db_xref="InterPro:IPR009050" /db_xref="InterPro:IPR012292" /db_xref="UniProtKB/TrEMBL:Q42801" /translation="MGAFTEKQEALVNSSFEAFKANLPHHSVVFFNSILEKAPAAKNM FSFLGDAVDPKNPKLAGHAEKLFGLVRDSAVQLQTKGLVVADATLGPIHTQKGVTDLQ FAVVKEALLKTIKEAVGDKWSEELSNPWEVAYDEIAAAIKKAMAIGSLV" exon <363. .460 /number=1 intron 461. .554 /number=1 exon 555. .663 /number=2 intron 664. .2181 /number=2 exon 2182. .2286 /number=3 intron 2287. .3064 /number=3 exon 3065. .>3208 /number=4 ORIGIN 1 TTTTACTCAA ATCAATGATA TATATTTTGG TAACTTTTTT TCTTTTACTT ATAATTTTGT 61 TTACGTTAAA AGTCAAAAAA GAATACATTA AAAAATTAAA AATTCACCGA ACAACTTAAA 121 TTATTTATTT ACTTTGACTA AGTGAAAAAT TACTTGATTA AGTTTTTGAA AAGGTCGTTG 181 TGTCTTCATA ATGCCGATTG ATACGCTCCA CATTCAATAA GCCAAGAGAG ACATATTCAA 241 TAACAATCGC AACAAATTTT TTTTCAGTCT CCAAACCATC TATATAAACA AGTATTGGAT 301 GTGAACTTAT AACTGGATTG AAAATAGAAA TTAAATAACA GAAAATTACA AAAGATCGAA 361 ATATGGGTGC TTTTACAGAG AAGCAAGAGG CTTTGGTGAA TAGCTCGTTT GAAGCATTCA 421 AGGCAAACCT TCCTCACCAC AGCGTTGTAT TCTTCAATTC GTAATTTTTC TCTCTCACCC 481 TATGTTTCCC TTGAGTTGAA AAGAGGTAGT GTACATAATA GTGTCTTTGG TTTGATTAAA 541 AAACAAAATA ATAGGATATT GGAGAAAGCA CCAGCAGCAA AGAACATGTT CTCATTTTTA 601 GGTGATGCAG TAGATCCGAA AAATCCTAAG CTCGCGGGCC ATGCTGAAAA GCTTTTTGGA 661 TTGGTAAGTG TTAGTCAACT AAAATTATAG TTATTTTATG TGATTTTAGG GATGTATACT 721 GGATTAGATT TTAAAAGATT ATTTTAGTGT TTGTATATTT TAAAAATTAT ATAAGAATAT 781 TATGATTTAT TAAATTTTTT TATAAGAATT TTATGATAGT TTAAATTTTA ATGTATTTAT 841 ATCATAAAAT TTNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 961 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1021 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1081 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1141 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1261 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1321 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1381 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1441 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1561 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1621 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1681 NNNNNNNNNN NNNNNNNNNN NNTAGAAAGG GTTTGATAGA CATTTAATAC TAAAAAGTCA 1741 TATAAAATCC ATCAAATTTA TCTTAAAAAG TAGTCTATCG AAATTTGTAT ATTTTTTAAA 1801 TGTTTCAAAA CTTTTTTTTA AAGTTATAAA AAACTTAATT GAATACCACC AAATTTTGTT 1861 GAATTCATTA AAAAAAATTT AAAAATATTA ATTGAATATT ATAAGACTTA TTTATAGCAT 1921 TTAAAAATTC TCATTAAATA CCAGAAAAAA AAATTTATAA TTTTTTTTTA AAATGTTAAT 1981 CCAGTATACT CCGCTCTTAA AATTAAACAT GCACTTATAT AATACTCTTA AAATATCAAT 2041 GAACATTTGT TAAATTGTAT TTTATATCTT TAACATATCT CGTACTAGGG ATTAATAATG 2101 TATAAAATTA TATTAGTATT TCTTGATTAG TTTTCTTTAA TGATCATCTT ATCATATATT 2161 ATGATATTTT TCAAATTGTA GGTGCGTGAC TCAGCTGTTC AACTTCAAAC AAAGGGATTA 2221 GTGGTGGCTG ATGCCACACT TGGTCCTATC CACACCCAAA AAGGAGTTAC CGATCTTCAA 2281 TTTGCGGTAT GATAAATAAA TTGTTATAAT AAATGCATAC TTAAATCTAA CATGGTGTAT 2341 TCTGTAATGA TCATCACTTC TTTTGTTTAG TAATGAATTT ACTAAACATA TAACATTTAA 2401 TGTCCTCAAT ATTATATTAT TGTAATTTTC AATGATTTTT TTAATTAATA ATTCTTTGAG 2461 CAATGTTTAA AGAAATTGAT TTAACACTGC AATAACTAGC TTCCTCTTTC TCAATTTTTA 2521 TATAAATTAT CACATATACT ATGGAAATAA AATAAGCTAA ACTATGTGAT TATTTATTTT 2581 TGGATTAGGT TATTAATAGC ATTTTATAAT CTATTGGCTA TTGCAACGTT GATTGATTAC 2641 TCAAAAAAAA AAAACTTTGA TTGATTATTA ATTATCATTT CTTTTTGCAT TGATTCTCGC 2701 CTGTCGACAT TTTTCTTTGA AGTTAAGATT CAATTCAAAT ATTTTCATTT ATTTATTCAT 2761 ATTTATTAAA GTCAACTTAA TTTTGTTTAT AAATACTATT ATTTTTTTTC TATTTATTGT 2821 ATTAGTATGA AGTAGGTCTC GTAAAAAACA GACAAATTAT TTCATAGTCA ACCAGTATTT 2881 TATTTAATTT TAAATTATTT AATTAAAAAA CACAAAACCT TATTACTTGT ATCAACAGTT 2941 GCTGATTTAT AAACATTACT ACTACAACTT CTTGGGATAT TAATAGCATT ACTGCGAAAC 3001 TTTAATATGA AAATTTAATT ATAAGGAAAA ATGTAAGAGC TAAAACATCA TTATTGATTC 3061 GTAGGTGGTT AAAGAAGCAC TGCTTAAAAC AATAAAGGAA GCAGTTGGGG ACAAATGGAG 3121 CGAAGAATTG AGCAATCCTT GGGAAGTAGC CTACGATGAA ATTGCAGCCG CAATTAAGAA 3181 GGCAATGGCT ATAGGATCAT TAGTATAAAG TCTAGTAGTA ATAAATAAAT TTTGTTTCAC 3241 TAAAATTTGT TATTAACTTC TTGATATAAA TGTCGGTTAC ATTAGGTAAA ATACAGTACT 3301 TGTCTTTGAA TAAACAATAT TAAATTATTT GCCTCAGGGT TTATGTTTAT GAATCACAAT 3361 CGATACTTTA TACATGTTTT AAAATTATTT TAATAAGCTT //